Mining pure linguistic associations from numerical data

نویسندگان

  • Vilém Novák
  • Irina Perfilieva
  • Antonín Dvorák
  • Guoqing Chen
  • Qiang Wei
  • Peng Yan
چکیده

This paper contains a method for direct search of associations from numerical data that are expressed in natural language and so, we call them ‘‘linguistic associations’’. The associations are composed of evaluative linguistic expressions, for example ‘‘small, very big, roughly medium’’, etc. The main idea is to evaluate real-valued data by the corresponding linguistic expressions and then search for associations using some of the standard data-mining technique (we have used the GUHAmethod). One of essential outcomes of our theory is high understandability of the found associations because when formulated in natural language they are much closer to the way of thinking of experts from various fields. Moreover, associations characterizing real dependencies can be directly taken as fuzzy IF–THEN rules and used as expert knowledge about the problem. 2007 Elsevier Inc. All rights reserved.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mining Linguistic Information for Configuring a Visual Surface Inspection System

The configuration of a surface inspection vision system as a complex task, requires mining associations among attributes due to the variability of the surface and the environment in real-time production process. The surface inspection task has to change to deal with different elements such as, wood, stainless steel or paper inspection and, in the case of stainless steel, with reflectance and th...

متن کامل

Fuzzy Weighted Data Mining from Quantitative Transactions with Linguistic Minimum Supports and Confidences

Data mining is the process of extracting desirable knowledge or interesting patterns from existing databases for specific purposes. Most conventional data-mining algorithms identify the relationships among transactions using binary values and set the minimum supports and minimum confidences at numerical values. Linguistic minimum support and minimum confidence values are, however, more natural ...

متن کامل

Fuzzy transform in the analysis of data

Fuzzy transform is a novel, mathematically well founded soft computing method with many applications. In this paper, we present this technique with applications to data analysis. First, we show how it can be used for detection and characterization of dependencies among attributes. Second, we apply it to mining associations that have a functional character. Moreover, the mined associations are c...

متن کامل

Data Mining with Linguistic Thresholds

Data mining is the process of extracting desirable knowledge or interesting patterns from existing databases for specific purposes. In the past, the minimum supports and minimum confidences were set at numerical values. Linguistic minimum support and minimum confidence values are, however, more natural and understandable for human beings. This paper thus attempts to propose a new mining approac...

متن کامل

YAPPIE — Learning information extraction patterns from unlabeled data

Motivation: A major goal in biomedical text mining is the extraction of biological entities, associations between them, and their respective mapping to database entries. One common and successful approach is to use sets of linguistic patterns that match, for instance, protein-protein interactions or gene-disease associations in a sentence. Pattern engineering is usually done by hand or relies o...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Int. J. Approx. Reasoning

دوره 48  شماره 

صفحات  -

تاریخ انتشار 2008